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Abstract 

We study partitions of the symmetric group which have desirable geometric properties. 
The statistical tests defined by such partitions involve counting all permutations in the 
equivalence classes. These permutations are the linear extensions of partially ordered sets 
specified by the data. Our methods refine rank tests of non-parametric statistics, such 
as the sign test and the runs test, and are useful for the exploratory analysis of ordinal 
data. Convex rank tests correspond to probabilistic conditional independence structures 
known as semi-graphoids. Submodular rank tests are classified by the faces of the cone of 
submodular functions, or by Minkowski summands of the permutohedron. We enumerate 
all small instances of such rank tests. Graphical tests correspond to both graphical models 
and to graph associahedra, and they have excellent statistical and algorithmic properties. 



1 Introduction 

The non-parametric approach to statistics was 
introduced by (Pitman, 1937). The emergence 
of microarray data in molecular biology has led 
to a number of new tests for identifying signifi- 
cant patterns in gene expression time series; see 
e.g. (Willbrand, 2005). This application moti- 
vated us to develop a mathematical theory of 
rank tests. We propose that a rank test is a 
partition of S n induced by a map t : S n > T 
from the symmetric group of all permutations of 
[n] = {1, . . . , n} onto a set T of statistics. The 
statistic t(tt) is the signature of the permuta- 
tion 7r € S n . Each rank test defines a partition 
of S n into classes, where n and ir 1 are in the 
same class if and only if t(tt) = t(tt'). We iden- 
tify T = image(r) with the set of all classes in 
this partition of S n . Assuming the uniform dis- 
tribution on S n , the probability of seeing a par- 
ticular signature t £ T is 1/n! times |r _1 (t)|. 
The computation of a p-value for a given per- 
mutation 7r € S n typically amounts to summing 
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over all permutations ir' with Pr(7r') < Pr(7r). 
In Section 2 we explain how existing rank tests 
can be understood from our point of view. 



In Section 3 we describe the class of con- 
vex rank tests which captures properties of tests 
used in practice. We work in the language of 
algebraic combinatorics (Sta nley, 1997| ). Con- 
vex rank tests are in bijection with polyhedral 
fans that coarsen the hyperplane arrangement 
of S n , and with conditional independence struc- 



tures known as semi-graphoids ( Studeny, 2005 ) 



Section 4 is devoted to convex rank tests that 
are induced by submodular functions. These 
submodular rank tests are in bijection with 
Minkowski summands of the (n— l)-dimensional 
permutohedron and with structural imset mod- 
els. Furthermore, these tests are at a suitable 
level of generality for the biological applications 
that motivated us. We make the connections 
to polytopes and independence models concrete 
by classifying all convex rank tests for n < 5. 

In Section 5 we discuss the class of graphi- 
cal tests. In mathematics, these correspond to 
graph associahedra, and in statistics to graphi- 
cal models. The equivalence of these two struc- 
tures is shown in Theorem 1181 The implemen- 
tation of convex rank tests requires the efficient 
enumeration of linear extensions of partially or- 
dered sets (posets). A key ingredient is a highly 
optimized method for computing distributive 
lattices. Our software is discussed in Section 6. 



2 Rank tests and posets 

A permutation tt in S n is a total order on [n] = 
{l,...,n}. This means that tt is a set of 
ordered pairs of elements in [n]. If tt and tt' are 
permutations then n Pin' is a partial order. 

In the applications we have in mind, the data 
are vectors u G K n with distinct coordinates. 
The permutation associated with u is the to- 
tal order tt = {(i,j) G [n] x [n] : it, < Uj }. 
We shall employ two other ways of writing 
this permutation. The first is the rank vector 
p = (pi, . . . ,p n ), whose defining properties are 
{pi, . . . , p n } = [n] and pi < pj if and only if 
Ui < Uj. That is, the coordinate of the rank 
vector with value i is at the same position as the 
ith smallest coordinate of u. The second is the 
descent vector 6 = (Si, . . . , 8 n ), defined by ug i > 
u$ i+1 . The ith coordinate of the descent vector 
is the position of the ith largest value of u. For 
example, if u = (11,7, 13) then its permutation 
is represented by tt = {(2, 1), (1, 3), (2, 3)}, by 
p= (2,1,3), or by S = (3, 1,2). 

A permutation tt is a linear extension of a 
partial order P on [n] if P C tt. We write 
C(P) C S n for the set of linear extensions of P. 
A partition r of the symmetric group S n is a pre- 
convex rank test if the following axiom holds: 



(PC) 



If r(vr) = r(vr') and tt" G C(tt n tt') 
then t(tt)=t(tt')=t(tt"). 



Note that tt" G C(tt n tt') means tt n tt' C 7t". 
For n = 3 the number of all rank tests is the Bell 
number Bq = 203. Of these 203 set partitions 
of 5*3, only 40 satisfy the axiom (PC). 

Each class C of a pre-convex rank test r cor- 
responds to a poset P on [n]; namely, P is 
the intersection of all total orders in that class: 
P = Pl^ec 7r - ^ 1 ' ie ax i° m (PC) ensures that C 
coincides with the set C(P) of all linear exten- 
sions of P. The inclusion C C £(P) is clear. For 
the reverse inclusion, note that from any permu- 
tation 7r in C(P), we can obtain any other tt' in 
C(P) by a sequence of reversals (a, b) >— > (6, a), 
where each intermediate 7r is also in C(P). As- 
sume 7To G £(P) and 7Ti G C differ by one rever- 
sal (a,b) G 7Tq, (6, a) G tt\. Then (6, a) ^ P, so 



there is some TT2 G C such that (a, 6) G 7^; thus, 
7ro G £(7ri n 7T2) by (PC). This shows ttq G C. 

A pre-convex rank test is thus an unordered 
collection of posets P±, , . . . , Pj. on [n] that sat- 
isfies the property that S n is the disjoint union 
of the subsets C(Pi), ■ ■ ■ , £>(Pk)- The posets Pj 
that represent the classes in a pre-convex rank 
test capture the shapes of data vectors. 

Example 1 (The sign test for paired data). 

The sign test is performed on data that are 
paired as two vectors u = (ui,U2, ■ ■ ■ , u m ) and 
v = (v±, v<2, . . . ,v m ). The null hypothesis is that 
the median of the differences ui — Vi is 0. The 
test statistic is the number of differences that 
are positive. This test is a rank test, because u 
and v can be transformed into the overall ranks 
of the n = 2m values, and the rank vector en- 
tries can then be compared. This test coarsens 
the convex rank test which is the MSS of Section 
4 with K = {{1, m + 1}, {2, m + 2}, . . . }. 

Example 2 (Runs tests). A runs test can be 
used when there is a natural ordering on the 
data points, such as in a time series. The data 
are transformed into a sequence of 'pluses' and 
'minuses,' and the null hypothesis is that the 
number of observed runs is no more than that 
expected by chance. A runs test is a coars- 
ening of the convex rank test r described in 
(Will brand, 20051 Section 6.1.1) and in Exam- 



ple n 

These two examples suggest that many tests 
from classical statistics have a natural refine- 
ment by a pre-convex rank test. The term 
"pre-convex" refers to the following interpreta- 
tion of the axiom (PC). Consider any two vec- 
tors u and u' in R 71 , and a convex combination 
u" = An + (1 - A>', with < A < 1. If 
tt,tt',tt" are the permutations of u,u',u" then 
tt" G C(tt n tt'). Thus the regions in 1" speci- 
fied by a pre-convex rank test are convex cones. 

3 Convex rank tests 

A fan in R n is a finite collection T of polyhe- 
dral cones which satisfies the following proper- 
ties: (i) if C G J- and C is a face of C, then 
C G (ii) If C, C G T, then C n C is a face 



of C. Two vectors u and v in R n are permu- 
tation equivalent when U{ < Uj if and only if 
Vi < Vj , and U{ = Uj if and only if = vj for all 
i,j € [n]. The permutation equivalence classes 
(of which there are 13 for n = 3) induce a fan 
which we call the S n -fan. The maximal cones 
in the 5 ra -fan, which are the closures of the per- 
mutation equivalence classes corresponding to 
total orders, are indexed by permutations 5 in 
S n . A coarsening of the S^-fan is a fan T such 
that every permutation equivalence class of W 1 
is fully contained in a cone C of J-; J- defines 
a partition of S n because each maximal cone of 
the S'n-fan is contained in some cone C £ J-. 
We define a convex rank test to be a partition 
of S n defined by a coarsening of the £ n -fan. We 
identify the fan with that test. 

Two maximal cones of the S^-fan share a wall 
if there exists an index k such that 5k = 8[ 



fe+l' 



8k+i = 8'k and 8i = 5[ for % ^ k, k + 1. That is, 
the corresponding permutations 5 and 5' differ 
by an adjacent transposition. To such an un- 
ordered pair {5,5'}, we associate the following 
conditional independence (CI) statement: 

8k -LL 6 k +i | {<5i, . . . ,5 k -i}. (2) 

This formula defines a map from the set of walls 
of the Sn-fan onto the set of all CI statements 



{% ALj\K : KQ[n]\{i,j}}. 



The map from walls to CI statements is not in- 
jective; there are (n — k — l)\(k — 1)\ walls which 
are labelled by the statement (|2"|). 

Any convex rank test T is characterized by 
the collection of walls {5,5'} that are removed 
when passing from the S* n -fan to T . So, from 
(J2J), any convex rank test T maps to a set M.? 
of CI statements corresponding to missing walls. 



Recall from flMatus, 2004| ) and JStudeny, 2005 ) 
that a subset M of T n is a semi-graphoid if the 
following axiom holds: 

iALj\KUi £ M and i AL£\K eM 
implies i _LL j \ K G M and i _LL £ \ KUj € M. 



Theorem 3. The map T > is a bijection 
between convex rank tests and semi-graphoids. 
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Figure 1: The permutohedron P3 and the Sa- 
fari projected to the plane. Each permutation 
is represented by its descent vector 5 = 5i<^2^3- 

Example 4 (Up-down analysis for n = 3). 

The test in flWillbrand, 2 005) is a convex rank 
test and is visualized in Figure 1. Permutations 
are in the same class if they are connected by a 
solid edge; there are four classes. In the S^-fan, 
the two missing walls are labeled by conditional 
independence statements as defined in @. 



Example 5 (Up-down analysis for n = 4). 

The test T in ( |Willbrand, 20051 ) is shown in Fig- 
ure 2. The double edges correspond to the 12 
CI statements in Mp. There are 8 classes; e.g., 
the class {3412,3142,1342,1324,3124} consists 
of the 5 permutations with up-down pattern 

(-+,-)■ 

Our proof of Theorem El rests on translating 
the semi-graphoid axiom for a set of CI state- 
ments into geometric statements about the cor- 
responding set of edges of the permutohedron. 

The S^-fan is the normal fan (Ziegler, 1995) 



of the permutohedron P n , which is the convex 
hull of the vectors (pi, . . . ,p n ) E W 1 , where p 
runs over all rank vectors of permutations in 
S n . The edges of P n correspond to walls and 
are thus labeled with CI statements. A collec- 
tion of parallel edges of P n perpendicular to a 



hyper plane Xi 



corresponds to the set of 



CI statements i _LL j\K, where K ranges over 
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Figure 2: The permutohedron P4 with vertices 
marked by descent vectors 5. The test "up- 
down analysis" is indicated by the double edges. 

all subsets of j}. The two-dimensional 

faces of P n are squares and regular hexagons, 
and two edges of P n have the same label in T n 
if, but not only if, they are opposite edges of a 
square. A semi-graphoid Ai can be identified 
with the set M of edges with labels from A4. 
The semi-graphoid axiom translates into a geo- 
metric condition on the hexagonal faces of P n . 

Observation 6. A set M of edges of the per- 
mutohedron P n is a semi-graphoid if and only 
if M satisfies the following two axioms: 
Square axiom: Whenever an edge of a square 
is in M ; then the opposite edge is also in M. 
Hexagon axiom: Whenever two adjacent edg- 
es of a hexagon are in M, then the two opposite 
edges of that hexagon are also in M. 

Let M be the subgraph of the edge graph of 
P n defined by the statements in A4. Then the 
classes of the rank test defined by M are given 
by the permutations in the path-connected com- 
ponents of M. We regard a path from 5 to 
5' on P„ as a word ■ ■ ■ in the free as- 
sociative algebra A generated by the adjacent 
transpositions of [n]. For example, the word 
o~23 '■= (23) gives the path from 5 to 5' = 0238 = 
<5i^3^2^4 • • • b~n- The following relations in A de- 



fine a presentation of the group algebra of S n : 

(BS) a{ : i + iai + k+l,i+k+2 — 0~i+k+l,i+k+20~i,i+l, 
(BH) CTj j j + iCTj + l j j + 20'j,j+l — CTj + l i j + 2CTj ) j + l(Tj + l ) j_|_2, 

(BN) 4 +1 - 1, 

where suitable i and k vary over [n]. The first 
two are the braid relations, and the last repre- 
sents the idempotency of each transposition. 

Now, we regard these relations as proper- 
ties of a set of edges of P n , by identifying a 
word and a permutation d with the set of edges 
that comprise the corresponding path in P n . 
For example, a set satisfying (BS) is one such 
that, starting from any 5, the edges of the path 
crj i j_|_icrj_|_fc + i j j_|_fc_|_2 are in the set if and only if the 
edges of the path cri+k+i,i+k+20~i t i+i are in the 
set. Note then, that (BS) is the square axiom, 
and (BH) is a weakening of the hexagon ax- 
iom of semi-graphoids. That is, implications in 
either direction hold in a semi-graphoid. How- 
ever, (BN) holds only directionally in a semi- 
graphoid: if an edge lies in the semi-graphoid, 
then its two vertices are in the same class; but 
the empty path at some vertex 5 certainly does 
not imply the presence of all incident edges in 
the semi-graphoid. Thus, for a semi-graphoid, 
we have (BS) and (BH), but must replace (BN) 
with the directional version 

(BN') 4 +1 - 1. 

Consider a path p from 5 to 5' in a semi- 
graphoid. A result of flTits, 1968D gives the fol- 
lowing lemma; see also ( [Brown, 19891 P- 49-51). 

Lemma 7. If M is a semi-graphoid, then if 5 
and 5' lie in the same class of A4, then so do 
all shortest paths on P n between them. 

We are now equipped to prove Theorem 03 
Note that we have demonstrated that semi- 
graphoids and convex rank tests can be re- 
garded as sets of edges of P n , so we will show 
that their axiom systems are equivalent. We 
first show that a semi-graphoid satisfies (PC). 
Consider 5, 5' in the same class C of a semi- 
graphoid, and let 5" £ C(8, 5'). Further, let p be 
a shortest path from 5 to 5" (so, p5 = 5"), and 
let q be a shortest path from 5" to 5' . We claim 



that qp is a shortest path from 5 to 5', and thus 
5" £ C by LemmaEJ Suppose qp is not a short- 
est path. Then, we can obtain a shorter path in 
the semi-graphoid by some sequence of substitu- 
tions according to (BS), (BH), and (BN'). Only 
(BN') decreases the length of a path, so the se- 
quence must involve (BN'). Therefore, there is 
some i, j in [n], such that their positions rela- 
tive to each other are reversed twice in qp. But 
p and q are shortest paths, hence one reversal 
occurs in each p and q. Then 5 and 5' agree on 
whether i > j or j > i, but the reverse holds 
in 5", contradicting 5" € C(S,8'). Thus every 
semi-graphoid is a pre-convex rank test. 

Now, we show that a semi-graphoid corre- 
sponds to a fan. Consider the cone corre- 
sponding to a class C. We need only show 
that it meets any other cone in a shared face. 
Since C is a cone of a coarsening of the S^-fan, 
each nonmaximal face of C lies in a hyperplane 
H = {xi = Xj}. Suppose a face of C coin- 
cides with the hyperplane H and that i > j in 
C . A vertex 5 borders H if i and j are adja- 
cent in 5. We will show that if 5,5' € C bor- 
der H, then their reflections 5 = 5± . . . ji . . . 5 n 
and 5' = 5[ . . . ji . . . 5' n both lie in some class 
C . Consider a 'great circle' path between 5 
and 5' which stays closest to H: all vertices in 
the path have i and j separated by at most one 
position, and no two consecutive vertices have i 
and j nonadjacent. This is a shortest path, so 
it lies in C, by Lemma Using the square and 
hexagon axioms (Observation EJ, we see that 
the reflection of the path across H is a path in 
the semi-graphoid that connects 5 to 5' (Figure 
3). Thus a semigraphoid is a convex rank test. 



5' 



show that M satisfies the square and hexagon 
axioms. This completes the proof of Theorem^! 

Remark 8. For n = 3 there are 40 pre-convex 
rank tests, but only 22 of them are convex rank 
tests. The corresponding CI models are shown 



Figure 3: Reflecting a path across a hyperplane. 

Finally, if M is a set of edges of P n , repre- 
senting a convex rank test, then it is easy to 



in Figure 5.6 on page 108 in (Studeny, 2005). 
4 The submodular cone 

In this section we examine a subclass of the con- 
vex rank tests. Let 2^ denote the collection 
of all subsets of [n] = {1,2,..., n}. Any real- 
valued function w : 2^ — > R defines a convex 
polytope Q w of dimension < n — 1 as follows: 

Q w := {i£K™ : x\ + x 2 H h x n = w([n]) 

and T,iei x i ^ W ( I ) for all / / C [n] }. 

A function w : 2^ — > R is called submodular if 
w(I)+w{J) > w(InJ)+w(IUJ) forl.JC [n]. 

Proposition 9. A function w : 2^ — > R is 
submodular if and only if the normal fan of the 
polyhedron Q w is a coarsening of the S n -fan. 

This follows from greedy maximization as in 
( Lovasz, 1 983). Note that the function w is sub- 
modular if and only if the optimal solution of 

maximize u ■ x subject to x € Q w 

depends only on the permutation equivalence 
class of u. Thus, solving this linear program- 
ming problem constitutes a convex rank test. 
Any such test is called a submodular rank test. 

A convex polytope is a (Minkowski) summand 
of another polytope if the normal fan of the lat- 
ter refines the normal fan of the former. The 
polytope Q w that represents a submodular rank 
test is a summand of the permutohedron P n . 

Theorem 10. The following combinatorial 
objects are equivalent for any positive integer n: 

1. submodular rank tests, 

2. summands of the permutohedron P n? 

3. structural conditional independence models, 

4. faces of the submodular cone C n in R 2 ™ . 

We have 1 <^=^ 2 from Proposition and 
1 <S=^ 3 follows from (Stud eny, 20 05). Further 
3 <^=> 4 holds by definition. 



The submodular cone is the cone C n of all 
submodular functions w : 2^ — > R. Work- 
ing modulo its lineality space C n D (— C n ), 
we regard C n as a pointed cone of dimension 
2 n -n-1. 

Remark 11. All 22 convex rank tests for n = 3 
are submodular. The submodular cone C3 is a 
4-dimensional cone whose base is a bipyramid. 
The polytopes Q w , as w ranges over the faces 
of C3, are all the Minkowski summands of P3. 
Proposition 12. For n > 4, there exist convex 
rank tests that are not submodular rank tests. 
Equivalently, there are fans that coarsen the S n - 
fan but are not the normal fan of any polytope. 
This result is stated in Section 2.2.4 of 



( Studeny, 2005 ) in the following form: "There 
exist semi-graphoids that are not structural." 

We answered Question 4.5 posed in 
( Postnikov, 20061 ) by finding a non-submodular 
convex rank test in which all the posets Pj are 
trees: 

M = {2iL3|{l,4}, liL4|{2,3}, 
1 _U_ 2|0, 3 _U_ 4|0}. 

Remark 13. For n = 4 there are 22108 sub- 
modular rank tests, one for each face of the 11- 
dimensional cone C4. The base of this submod- 
ular cone is a polytope with /-vector (1, 37, 356, 
1596, 3985, 5980, 5560, 3212, 1128, 228, 24, 1). 

Remark 14. For n = 5 there are 117978 
coarsest submodular rank tests, in 1319 
symmetry classes. We confirmed this 
result of flStudeny, 2000) ) with PDLYMAKE 
( Gawrilow, 2000| ). 

We now define a class of submodular rank 
tests, which we call Minkowski sum of simplices 
(MSS) tests. Note that each subset K of [n] 
defines a submodular function wk by setting 
wk(I) = 1 if KC\I is non-empty and wk{I) = 
if K n I is empty. The corresponding polytope 
Qw K is the simplex Ak = convjefc : k S K}. 

Now consider an arbitrary subset K, = 
{K u K 2 , . . . , K r } of 2N . It defines the submod- 
ular function wjc = wk 1 + wk 2 + ' ' ' + w K r ■ The 
corresponding polytope is the Minkowski sum 



The associated MSS test t/c is defined as follows. 
Given p G S n , we compute the number of indices 
j € [r] such that maxjpfc : k £ Kj} = pi, for 
each i € [n\. The signature Tjc{p) is the vector 
in N n whose ith coordinate is that number. Few 
submodular rank tests are MSS tests: 

Remark 15. For n = 3, among the 22 sub- 
modular rank tests, only 15 are MSS tests. For 
n = 4, among the 22108, only 1218 are MSS. 

5 Graphical tests 

Graphical models are fundamental in statistics, 
and they also lead to a useful class of rank tests. 
First we show how to associate a semi-graphoid 
to a family K. Let J~ WK be the normal fan of 
Qw K - We write M.^. for the CI model derived 
from J- Wic using the bijection in Theorem |5J 

Proposition 16. The semi-graphoid Mk, is the 
set of CI statements i _LL j \ K which satisfy the 
following property: all sets containing {i, j} and 
contained in {i,j} U are not in JC. 

Let G be a graph with vertex set [n]. We 
define KL{G) to be the collection of all subsets K 
of [n] such that the induced subgraph of G\k is 
connected. Recall that the undirected graphical 
model (or Markov random field) derived from 
the graph G is the set AA G of CI statements: 

A4 G = {i AL j \ C : the restriction of G to 
[n]\C contains no path from i to j}. 

The polytope = Ajc(G) 1S the graph asso- 
ciahedron, which is a well-studied object in com- 
binatorics dCarr, 2004[ |Postnikov, 20051 ). The 
next theorem is derived from Proposition 1161 

Theorem 17. The CI model induced by the 
graph associahedron coincides with the graphi- 



cal model M. , i.e., A4 



K{G) 



M 



G 



There is a natural involution * on the set of 
all CI statements which is defined as follows: 



(iALj\CY 



iALj\[n]\(CU{i,j}). 



If M is any CI model, then the CI model M* is 
obtained by applying the involution * to all the 
CI statements in the model M. Note that this 



A Kl + A K2 + • • • + A 



involution was called duality in ( Matus, 1992) 
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Figure 4: The permutohedron P4 . Double edges 
indicate the test twg<) when G is the path. 
Edges with large dots indicate the test T^ G y 

The graphical tubing rank test t^ g ^ is the test 
associated with M^ G y It can be obtained by 
a construction similar to the MSS test tjc, with 
the function wk, defined differently and super- 
modular. The graphical model rank test TcfG?) is 
the MSS test of the set family K,(G). 

We next relate t^q^ and t^, q s to a known 
combinatorial characterization of the graph as- 
sociahedron A^r. Two subsets A,B C [n] are 
compatible for the graph G if one of the fol- 
lowing conditions holds: A C B, B C A, or 
Afl5 = 0, and there is no edge between any 
node in A and B. A tubing of the graph G is a 
subset T of 2^ such that any two elements of 
T are compatible. Carr and Devadoss (2005) 
showed that Aq is a simple polytope whose 
faces are in bijection with the tubings. 

Theorem 18. The following four combinatorial 
objects are isomorphic for any graph G on [n] : 

• the graphical model rank test t^/q), 

• the graphical tubing rank test T^ G y 

• the fan of the graph associahedron Aq, 

• the simplicial complex of all tubings on G. 

The maximal tubings of G correspond to ver- 
tices of the graph associahedron A^. When 
G is the path of length n, then Ac is the 



associahedron, and when it is a cycle, A<^ is 
the cyclohedron. The number of classes in the 
tubing test t^,^ is the G-Catalan number of 



( Postnikov, 20051 ). This number is ^x( 2 ™) for 

for the cyclo- 



the associahedron test and 
hedron test. 



(2n-2\ 
{ n.-l ) 



6 Enumerating linear extensions 

In this paper we introduced a hierarchy of 
rank tests, ranging from pre-convex to graph- 
ical. Rank tests are applied to data vectors 
u € R n , or permutations it € S n , and locate 
their cones. In order to determine the signifi- 
cance of a data vector, one needs to compute the 
quantity |t -1 (t(7t)) |, and possibly the proba- 
bilities of other maximal cones. These cones are 
indexed by posets P\ , P2, ■ ■ ■ , Pfe on [n] , and the 
probability computations are equivalent to find- 
ing the cardinality of some of the sets C{P{). 

We now present our methods for computing 
linear extensions. If the rank test is a tubing 
test then this computation is done as follows. 
From the given permutation, we identify its sig- 
nature (image under r), which we may assume 



is its G-tree T ( |Postnikov, 2005| ). Suppose the 
root of the tree T has k children, each of which 
is a root of a subtree T* for i = 1, . . . , k. Writing 
IT* I for the number of nodes in T\ we have 



.-1 



(T) 




This recursive formula can be translated 
into an efficient iterative algorithm. In 



(Will brand, 20 05) the analogous problem is 
raised for the test in Example 3. A determi- 
nantal formula for (^Q) appears in dStanley, 19971 
page 69). 

For an arbitrary convex rank test we proceed 
as follows. The test is specified (implicitly or 
explicitly) by a collection of posets P\,...,P^ 
on [n]. From the given permutation, we first 
identify the unique poset Pi of which that per- 
mutation is a linear extension. We next con- 
struct the distributive lattice L(P{) of all order 
ideals of Pi . Recall that an order ideal is a sub- 
set O of [n] such that if I G O and (k, I) G Pi 



then k £ O. The set of all order ideals is a lat- 
tice with meet and join operations given by set 
intersection OCiO' and set union OUO'. Knowl- 
edge of this distributive lattice L(Pi) solves our 
problem because the linear extensions of Pi are 
precisely the maximal chains of L{Pi). Com- 
puting the number of linear extensions is #P- 



complete (Bright well, 1991 ). Therefore we de- 
veloped efficient heuristics to build L(Pi). 

The key algorithmic task is the following: 
given a poset Pi on [n], compute an efficient 
representation of the distributive lattice L(Pi). 
Our program for performing rank tests works 
as follows. The input is a permutation tt and a 
rank test r. The test r can be specified either 

• by a list of posets P\ , . . . , Py. (pre-convex) , 

• or by a semigraphoid A4 (convex rank test), 

• or by a submodular function w : 2^ — > R, 

• or by a collection K, of subsets of [n] (MSS), 

• or by a graph G on [n] (graphical test). 



The output of our program has two parts. First, 
it gives the number |>C(Pj)| of linear extensions, 
where the poset P, represents the equivalence 
class of S n specified by the data tt. It also 
gives a representation of the distributive lat- 
tice L(Pi), in a format that can be read by 
the maple package posets (Stembridge, 2004). 
Our software for the above rank tests is available 
at www.bio.math.berkeley.edu/ranktests/. 
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